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Using a large-scale hydrodynamic simulation with heuristic criteria for 
galaxy formation, we investigate how the galaxy field is related to physical 
parameters, such as the mass density and the gas temperature. In our flat Cold 
Dark Matter model with Q = 0.37, we find that the relation between the galaxy 
and mass density fields is a function of scale. The bias b(R) = a g (R)/a(R), 
where u g {R) is the variance of galaxy counts in spheres of radius R and cr(R) is 
the same for mass, varies from 2.6 at 1 h^ 1 Mpc to 1.2 at 30 h~ x Mpc. Including 
the dependence of the galaxy density on local gas temperature as well as on 
local mass density can fully account for this scale dependence. Galaxy density 
depends on temperature because gas which is too hot cannot cool to form 
galaxies; this causes scale dependence of b(R) because local gas temperature 
is related to the gravitational potential, and thus contains information about 
the large scale density field. We show that temperature dependence generally 
causes b(R) to vary on quasilinear and nonlinear scales, indicating that scale 
dependence of bias may be a generic effect in realistic galaxy formation scenarios. 
We find that the relationship between the galaxy and mass density fields is 
also a function of galaxy age. On large scales, the older galaxies are highly 
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biased {b rs 1.7) and highly correlated (r = (55 g )/<j(r g ~ 1.0) with the mass 
density field; younger galaxies are not biased {b « 0.8) and are poorly correlated 
(r rs 0.5) with the mass. We argue that linear bias is inadequate to describe 
the relationship between galaxies and mass. We present a more physically based 
prescription which better fits our results and reproduces the scale dependence 
of the bias: p g /{p g ) = L(p/(p)) M (l + T/40, 000 K) N , where L = 1.23, M = 1.9, 
and N = -0.66. 



Subject headings: large-scale structure of universe, galaxies: formation 



1. Motivation 

Imminent large-scale galaxy redshift surveys such as the Sloan Digital Sky Survey 
(SDSS; |Gunn fc Weinberg 1995| ) and the Two-Degree Field (2DF; |Colless 1998Q will 
probe the galaxy density field of the universe with unprecedented precision. Were galaxies 
accurate tracers of the mass density field, the results of these surveys would put severe 
constraints on cosmological models. If the Cold Dark Matter (CDM) picture for the 
linear theory power spectrum is correct, these surveys can in fact measure cosmological 
parameters, such as the mass density f2 , the vacuum density A, and the baryon density fi& 
( |de Laix fc Starkman 1998] ; [l'egmark et al. 19"9~5| ; |Goldberg fc Strauss 1998| ; |Wang, Spergel 



|fc Strauss 1999|) . However, the visible matter in galaxies is only a small percentage of the 
baryons in the universe, which in turn is a small percentage of the mass in the universe 
( Fukugita, Hogan, fc Peebles 1998j ; |Cen fc Ostriker 1998b|) . Moreover, the process of galaxy 



formation is complex and nonlinear, including a complicated interplay between gravitational 
fields, hydrodynamics, microphysics and star formation. Does this complicated process 
produce a population of galaxies whose number density field traces the mass density field 
perfectly? In this introductory section, we argue that observations already suggest that it 
does not: that the relationship between the density fields of galaxies and mass is biased, 
scale dependent, and nonlinear, as well as dependent on morphological type. This discussion 
provides the observational motivation for asking the theoretical question: how is the galaxy 
density field related to that of the mass? 

The crucial element of our argument is that different morphological types of galaxies 
have different density fields ( [Hubble 1936j ; |Oemler 1974j ) . Consider the observed differences 



between the clustering strengths of galaxies of different types. Various authors have 
compared elliptical and spiral galaxies, generally finding that the fluctuation amplitude 
of ellipticals is stronger than that of spirals by a factor of 1.3-1.5 ( pavis fc Geller 1976] ; 
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Giovanelli, Haynes, fc Chincarini 1986 ; |Santiago fc Strauss 1992 ; Loveday et al. 1996| : 



Hermit et al. 1996| ; puzzo et al. 1997]) . Similarly, a comparison of the galaxy distribution in 



the IRAS redshift survey ( |Strauss et al. 1992 b| ) with those in the Center for Astrophysics 



Redshift Survey (CfA; |Huchra et al. 1983] ) and in the Optical Redshift Survey (ORS; 
Santiago et al. 1995|) , shows that optically-selected galaxies are clustered more strongly than 



infrared-selected galaxies by a similar factor ( Pavis et al. 1988| ; [Babul fc Postman 1990| 



|Strauss et al. 1992a| ). These differences in the amplitude of clustering may be accounted for 



by invoking a deterministic linear bias prescription: 

8 g (v) = b8(v), (1) 

where 8 g (r) = p g {r)/ (p g ) — 1 is the galaxy overdensity and <5(r) = p(r)/ (p) — 1 is the mass 
overdensity, smoothed on some scale. To explain the observations, one must assume that 
different galaxy populations are "biased" by different factors 6; clearly all but one of these 
bias factors must differ from unity. Furthermore, there are differences between the shapes 
of correlation functions of galaxies of different types, at least at small scales. For instance, 
the ratio of the correlation functions of ellipticals and spirals found by [Hermit et al. (1996) 



and puzzo et al. (1997) declines with scale over the range between 1 and 10 h 1 Mpc. This 



scale dependence cannot result from a deterministic linear bias. Therefore, there must exist 
a more complicated relation between the density fields of different morphological types. 

Alternatively, consider the density-morphology relation, quantified by Pressler (1980)| , 
Postman fc Geller (1984)] , and [Whitmore, Gilmore, fc Jones (1993)| . In the field, spirals 



comprise about 70% of all galaxies, and ellipticals and lenticulars comprise the rest; in the 
cores of rich clusters the situation is reversed, and ellipticals and lenticulars account for 
90% of all galaxies. The relationship between spiral density and the density of all galaxies 
is extremely nonlinear in the densest cluster regions. 

These differences among different morphological types suggest that the relationship 
between all galaxies and mass is comparably complicated. After all, it would be a 
coincidence if the overdensity field of all galaxies exactly traced the full mass overdensity, 
despite the fact that the different morphologies have formed at different times and with 
different efficiencies. In any case, the selection effects of redshift surveys (color, surface 
brightness, luminosity, etc.) will cause any catalog to contain a mix of morphological types 
that differs from the mix in a volume-limited sample. Since the overdensity fields of different 
morphologies have different density fields, the results of every survey are "biased" to some 
degree. It is thus interesting to explore theoretically how the mass density in the universe 
might be related to the galaxy density, and to the density of different morphological types. 



One approach is to express the galaxy density as a local transformation of the dark 
matter density (or other variables), the simplest version of which is the deterministic 
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linear bias of Equation (|l|). Peaks-biasing ( |Bardeen et at. 1986Q and threshold biasing 
( |Kaiser 1984| ) were the first suggestions along these lines. Indeed, pavis et at. (1985) 
used the peaks-biasing scheme to reconcile an Q = 1 universe with observations. Fry 



fe Gaztariaga (1993) , Uoles (1993) , Mann, Peacock, fc Heavens (1998] , and IN arayanan^ 



Berlind, fc Weinberg (1998 )| have studied how nonlinearity and other complications in this 
transformation can affect statistics such as the density distribution function, the correlation 
function, and M/L ratios of clusters. Another possibility is that the local transformation is 
not deterministic; (Scherrer fc Weinberg (1998)| show how such a stochastic relation affects 
the correlation function and the power spectrum of galaxies. |Dekel fc Lahav ( 1998 )| have 



developed a general formalism (which we adopt here) for expressing such a nonlinear and 
stochastic relation. 

A second approach is to use semi-analytic models to simulate galaxy formation 
(e.g., [White et at. 19871 ; IWhite fc Frenk 1991| ; |Cole et at. 1991 ; Kauffmann et at. 1993, 
1998; |Somerville fc Primack 1998|) . These methods use either iV-body simulations or the 
formalism of Press fc Schechter ( 1974"]] and [Mo fc White ( 1996 )| to follow dark matter 
halos and their merging histories. Simple rules for star formation and feedback follow 
the evolution of galaxies inside these halos. [Kauffmann, Nusser, fc Steinmetz (1997)| and 
[Kauffmann et at. (1998)| used such models to study the relation between mass density and 



galaxy density. We compare their results to ours below. 

The third approach, which we adopt here, is to use hydrodynamic simulations with 
heuristic models of galaxy formation built into them. Generically, these simulations 
follow the radiative physics of the gas and use physically motivated prescriptions to 
convert baryonic fluid into collisionless stellar particles, which is necessary due to our 
ignorance of the details of the star-formation processes and the finite resolution of the 
code. |Oarlberg fc Gouchman (1989)| , |Evrard, Summers, fc Davis (1994]| , and Katz, 
Hernquist, & Weinberg (1992, 1998) have all used the Lagrangian smoothed particle 
hydrodynamic (SPH) method to address the question of galaxy bias; Gnedin (1996a,b), 
Cen & Ostriker (1992, 1998a), and this paper use Eulerian methods for the same purpose. 

In this paper, we examine the galaxy density field produced by the large-scale 
hydrodynamic simulations of |Gen fc Ostriker (1998a)| . With recent improvements in 
computational methods and in computer hardware, the dynamic range in these simulations 
is approaching the level at which one can examine the formation of galaxies in a cosmological 
context. In Section |2|, we present the formalism of |Dekel fc Lahav (1998)] for expressing 
the relation between galaxies and mass, and review some previous theoretical results. In 
Section |||, we present details of the numerical simulations of |Cen fc Ostriker (1998a)| . In 



Section [|, we examine the relation between the distributions of galaxies and mass in these 
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simulations, as a function of smoothing scale and age. In Section |5], we show that one 
can explain the properties of the galaxy density field more completely by allowing for its 
dependence on gas temperature as well as on mass density. We present an analytic fit to 
our results which describes the galaxy density field well. In Section we show that the 
dependence on gas temperature explains the scale dependence of the bias. Finally, we 
introduce a toy model which reproduces some of the salient properties of the galaxy density 
field, and shows how scale dependence could be a generic property of galaxy formation. We 
discuss some directions for future work in Section [7|. 



2. Formalism 



In this section, we present a general formalism developed by |Dekel fc Lahav ( 1998 )| for 



expressing the present-day Eulerian relation between the galaxy and mass density fields 
smoothed on a local scale Rq. Such an approach, which considers only the density fields 
at z — 0, of course does not account for the fact that two regions which have similar 
properties now may have had significantly different histories. We relate this formalism to 
the simple approximation that the galaxy and mass densities form a bivariate Gaussian 
distribution. Finally, we describe how certain complications allowed by this formalism, 
namely nonlinearity and stochasticity, can cause the relation between the two density fields 
to be a function of scale. This formalism is not a theoretical model in its own right, but 
rather gives a useful framework for quantifying all relevant aspects of galaxy bias. 

We begin by defining P(5 g \Xi) as the conditional probability of a certain overdensity of 
galaxies 5 g = p g / (p g ) — 1, given the set of local conditions expressed by X iy where the X t can 
be mass density, temperature, velocity shear, or any other present-day parameter thought 
to be relevant to galaxy formation. We use this probabilistic formalism despite the fact that 
galaxy formation is a deterministic process, for two reasons. First, two regions with similar 
properties at z = may have had quite different histories; these differences may make it 
impossible to find a perfectly deterministic relation between 5 g and other Eulerian variables. 
Second, even if such a relation did exist, one might not be able to identify the correct set 
of Eulerian variables with which to express it. Choosing a wrong or incomplete set of X{ 
would cause the relation to have large scatter. Because this resulting stochasticity would in 
fact have a physical basis, it would be likely to have interesting statistical properties, such 
as spatial correlations. Thus, one of our goals will be to find a set of X% which minimizes 
this scatter. 
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Having defined the conditional probability, one can define the conditional mean: 

{5 g \Xi) = JdS a 5 a P(5 g \Xi). (2) 
The variance of the scatter about this mean is: 

<t = (* t ) = ({8 a -(6 g \X i ))*), (3) 

where, as indicated, e represents the residuals of the conditional mean galaxy density at 
scale Rq. The quantity <7b/cr g , where a 2 = (8g), expresses the degree of stochasticity in 
the relation between galaxies and the variables Xj. Typically, investigators in this field 
have assumed that the most important (if not the only) local condition worth considering 
is the mass overdensity — 8 = p/(p) — 1 (an exception being [Narayanan, Berlind, fc 
|Weinberg 1998| ). In this case, one considers the conditional probability P(8 g \8) and the 
corresponding conditional mean (8 g \8), which is meant to summarize the relation between 
galaxies and mass on the given scale Rq. 

If the joint distribution of 8 and 8 g is a bivariate Gaussian at scale Rq, an appropriate 
approximation in the linear regime, the joint probability can be expressed as: 

l ay-2(88 g )88 g + ( r^ \ 
2 aV^ - (88 g y )' U 

We will refer to this special "linear bias." Such a distribution can be completely 

characterized by three quantities: o 2 = (8 2 ), b = cr g /a, and r = (88 g )/aa g , and indeed, 
this motivates the definitions of these quantities for arbitrary P(8,8 g ). The quantity b, 
hereafter, the "bias," compares the r.m.s. amplitude of the galaxy overdensity to that of 
the mass overdensity. The quantity r, hereafter the "correlation coefficient," expresses 
how closely the galaxy density field traces the mass density field. That is, if r = ±1, the 
relation between mass and galaxies is deterministic; if r = the galaxies are distributed 
independently of the mass. Note that the Gaussian assumption implies: 

(S g \5) = brS; 

Cfe/og = Vl — r 2 (Gaussian model). (5) 

In the nonlinear regime, the density field is far from Gaussian; however, calculating 
the second moments b and r will still give useful information on how the galaxy and mass 
density fields relate. The quantity br for any given pair of galaxy and mass density fields is 
the slope of a linear regression of 8 g on 8. Similarly, b/r is the slope of the linear regression 
of 8 on 8 g . The quantity y/1 — r 2 is a measure of the scatter around either regression. The 



P(8,8 g 
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(88 n y 
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scatter can occur either because nonlinearities make a straight line a poor approximation to 
(S g \5), or because of stochasticity. As pekel fc Lahav (1998)| note, the ratio a&/ (cr g \/l — r 2 ) 
measures the contribution of stochasticity (as opposed to nonlinearity) to the total scatter 
around the linear regression br. 

In the context of the formalism presented here, let us examine how scale dependence 
in the relation between galaxies and mass may arise. Most work to date assumes that 
the deterministic linear bias of Equation (0) holds (i.e. r = 1). Such a relation is scale 
independent; the same factor b applies at all scales. However, (S g \8) may in general be 
a nonlinear function of 5. In fact, on scales at which a 3> 1, this nonlinearity inevitably 
results from the condition 5 g > —1 (as long as (8 g \5) does not exactly equal 5). A simple 
approach is to expand 5 g in a Taylor series around 5 ( |Fry fc Gaztanaga 1993| ): 

(5 g \5) = M + |(5 2 - (6 2 )) + |(5 3 - <5 3 » + • • • (6) 

The introduction of nonlinearity opens the door to scale dependence; if 5 g and 5 are 
smoothed on a scale above Ro, the coefficients in Equation (|j) may change. On the other 
hand, Scherrer fc Weinberg ( 1998^1 show that for hierarchical clustering, even in the presence 



of nonlinear bias on small scales, b(R) is independent of R on large scales. 

In addition to being nonlinear, the relation can be stochastic, such that ^ 0. 



Scherrer fc Weinberg (1998)| have found that b is independent of scale in this case as well. 
However, they assumed that the residual field e about (S g \5) is spatially uncorrected. 
However, the scatter e may have a physical basis; thus, it is possible that e correlates with 
the large scale density field. If such a correlation existed, b(R) would be a function of scale. 
To illustrate this possibility, assume for the moment that at some small smoothing scale 
Ro, the joint 5 g -5 distribution is a bivariate Gaussian (Equation 4), with a bias of b(R ) 
and correlation coefficient r(R ). We define e = 5 g — br5 on this scale. We can subsequently 
smooth 5 g , 5, and e over a large scale R 3> -Ro- In this case: 

m = iW(Ro) + (' 2 >«+ 2 "(Wfl.)(^ (7) 

a 2 {R) 

As R — > Rq, by definition we have (Se) R w and (e 2 )^ w a 2 (i? )(l — r 2 (Ro)), so that 
b{R -> #0) = &(-Ro) as necessary. If (Se) R or (t 2 ) R varies on larger scales, clearly b(R) 
will vary as well. We show below that this variation can result from fairly simple physical 
considerations; indeed, the effects are strong in the simulations considered here. 



In the following sections, we will describe the numerical simulations of |Oen fc 



Ustriker (1998a)| and present the results in the context of the formalism presented here. As 



the independent variable X i} we at first use the traditional dark matter overdensity 5, and 
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find that the dependence on S cannot completely characterize the galaxy density field. We 
will find that using the description P(S g \S, T), where T is the local gas temperature, gives 
a much more satisfactory description of the galaxy density field. In particular, accounting 
for the dependence on T (or its counterpart, the dark matter velocity dispersion (v 2 )) also 
accounts for most of the scale dependence of the bias. Thus, temperature dependence 
causes stochasticity in the galaxy-mass relation, and this stochasticity is correlated with 
the mass distribution over large scales, causing b(R) to vary with scale. 



3. Simulations 

For these simulations, the work of Pstriker fc Steinhardt (1995)| motivated the choice 



of a flat cold dark matter cosmology with Q = 0.37, Q\ = 0.63, and Qf, = 0.049. 
Recent observations of high-redshift supernovae have lent support to the picture of a 
flat, low-density universe, though great uncertainty still remains ( Perlmutter et al. 1997 



Garnavich et al. 1998| ). The Hubble constant was set to H = 100 h km s" 1 Mpc" 1 , with 



h = 0.7. The primordial perturbations were adiabatic and random phase, with a power 
spectrum slope of n = 0.95 and amplitudes such that erg = 0.8 for the dark matter at z = 0, 
at which time the age of the universe is 12.7 Gyrs. We use a periodic box 100 h~ x Mpc on 
a side, with 512 3 grid cells and 256 3 dark matter particles. Thus, the mass resolution is 
about 5 x 1O 9 M and the grid cell size is ~ 200 h~ x kpc. The smallest smoothing length 
we consider is a 1 h~ l Mpc radius top hat, which is considerably larger than a cell size. On 
these scales and larger, the relevant gravitational and hydrodynamical physics are correctly 
handled. On the other hand, subgrid effects such as the fine grain structure of the gas and 
star formation may influence large-scale properties of the galaxy distribution. As we will 
decribe, we handle these effects using plausible, though crude, rules. 

Gen & Ostriker (1998a)' describe the hydrodynamic code in detail; it is similar to but 
greatly improved over that of |Cen fc Ostriker (1992a)| . The simulations are Eulerian on 



a Cartesian grid and use the Total Variation Diminishing method with a shock-capturing 
scheme. In addition, the code accounts for cooling processes and incorporates a heuristic 
galaxy formation criterion, whose essence is as follows: if a cell's density is high enough, if 
the cooling time of the gas in it is shorter than its dynamical time, if it contains greater 
than the Jeans mass, and if the flow around that cell is converging, it will have stars forming 
inside of it. The code turns a fraction of the baryonic fluid component into collisionless 
stellar particles (hereafter "galaxy particles"), which subsequently contribute to metal 
production and the background ionizing UV radiation. The masses of these galaxy particles 
range from about 10 6 to 10 9 M Q . Thus, many galaxy particles are contained in what 
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would correspond to a single galaxy in the real universe. Instead of grouping the particles 
into galaxies, we simply define a galaxy mass density field from the distribution of galaxy 
particles themselves. Thus our results will not be directly comparable to observations of the 
density field based on galaxy counts. 

Ideally, we would like to study the properties of 5 g for different galaxy morphologies. 
Our simulations clearly do not have the resolution necessary to determine the morphology of 
individual galaxies based on their internal structure. However, there is a rough correlation 
between a galaxy's morphology and its star-formation history ( [Roberts fc Haynes 1994fc 
Kennicutt 1998| ). Keeping this in mind, we can examine the simulations for an age-density 



relation in analogy to the morphology-density relation of |Dressler (1980)| and |Postman fc| 



Cjeller (1984)| . |(Jen fc (Jstriker (1993)| did so, finding qualitatively that the oldest galaxies 



had fallen into clusters and that the younger galaxies were forming in the lower density 
regions. In addition, Pen fc Ostriker (1998a)| have examined the dependence of the power 
spectrum on galaxy age. Here, we visit the problem again, now looking in more detail at 
the joint distribution of the mass density and the density of each galaxy population. We 
split the galaxy particles into four age quartiles, defined such that the total mass of galaxy 
particles in each quartile is the same. These age quartiles are not meant to be taken as 
literally corresponding to different morphologies, since there are other variables besides age 
which determine galaxy type; however, their differences should at least be indicative of 
the differences between different galaxy morphologies. The age and redshift ranges of the 
quartiles are given in Table [l]. As we examine the properties of the full galaxy mass density 
field, we do the same for the density fields of each age quartile. 



Figures [Ta] and [TBI show a slice through the galaxy and dark matter density fields 50 



h~ x Mpc on a side, at 1 h 1 Mpc and 10 h 1 Mpc smoothing, respectively. From top left, in 
clockwise order, we show the quantity 5 for the dark matter, all of the galaxy particles, the 
youngest galaxy quartile, and the oldest galaxy quartile. For each of these, 5 is normalized 
to the mean density of the sample in question, so that in the absence of biasing, these plots 
would be identical. Evidently the galaxy distribution follows the dark matter distribution 
well, except in the underdense regions, which are completely empty voids in the galaxy 
particle distribution. It is apparent from these pictures that the youngest galaxy particles 
are distributed quite differently than is the mass. At small smoothing scales the effect is 
obvious only in the clusters; otherwise the young galaxies follow the dark matter. At large 
smoothing scales, however, the young galaxies are underdense in the clusters, and their 
density fields peak along the filaments. On the other hand, the oldest galaxies follow the 
mass distribution well on all scales, and are quite obviously biased. In the rest of the paper, 
we will quantify the differences among these various density fields. 
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4. Single- Variable Bias 

In this section, we study the relation between the galaxy and mass density fields, 
expressed by P(8 g \5) and (S g \5). We begin by smoothing the galaxy density field over 
several different scales and showing that the relationship between galaxies and mass is a 
function of scale. Then we show that it is the properties of the scatter about (S g \S) at small 
scales, and not the form of (5 g \5) itself, which causes this scale dependence. 



4.1. Galaxy Density vs. Mass Density 

First we directly compare the density field of dark matter to that of galaxies. We do so 
by plotting in Figure || the conditional probability P(l + 5 g \l + 5) using top hat smoothing 
filters of six different radii: 1, 2, 5, 8, 16, and 30 h~ l Mpc. We use 1 + 5 here simply for 
convenience in plotting the results. The greyscale in this figure is a logarithmic stretch of 
P(l + 5 g \l + 5); the greyscale for each column is normalized separately. Note that for small 
smoothing scales, the discreteness of the dark matter particles limits our measurement of 5 
in voids. The vertical dashed line is the density corresponding to about 50 particles within 
one smoothing length, below which this effect becomes important. Note further that there 
is structure in the histograms at large smoothing scales, since there are many bins in the 
histogram but only a few truly independent values in the periodic volume. We plot (S g \5) 
for each smoothing scale as the solid black lines; the la deviations from this mean line are 
shown as the dotted black lines. It is immediately apparent that this function is nonlinear, 
and that there is large scatter about it. 

The same comparison can be made for the galaxies in each age quartile separately, as 
is shown in Figures |3~a] and |3~b] for top hat smoothing filters of two radii: 1 and 30 h^ 1 Mpc. 



The overdensity 8 g for each quartile is defined by normalizing to the mean density of that 
quartile. Note that there is a tight and highly biased relation between the distributions of 
older galaxies and the dark matter. On the other hand, the relation between the youngest 
galaxies and the mass is quite stochastic, even at the largest scales. At small scales, (S g \S) 
for young galaxies is not monotonic; as in the real universe, young galaxies rarely live in the 
highest density regions. This trend with age is easily understood. The densest regions of 
the universe are in the deepest potential wells, so they have the hottest gas with the longest 
cooling times; thus, this gas stopped forming galaxies some time ago. As a consequence, on 
average, the galaxies in the densest regions are older than galaxies elsewhere. 



One can quantify the relation between galaxies and mass by calculating second 
moments of the galaxy-mass distribution. Thus, at each smoothing scale, we calculate the 
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parameters b = o g ja and r = (S g 5)/cr g a and list their values in Table |2|. Note that b declines 
strongly with scale, from 2.6 at 1 hr x Mpc to 1.2 at 30 h" 1 Mpc; this behavior is consistent 
with the work of |Uen fc Ustriker (1992b)| . Meanwhile, r ~ 0.9 almost independent of scale, 
meaning that galaxies are well-correlated with mass. Similarly, we can calculate c^, the 
variance of the scatter about (6 g \5), and find that crb/o~ g ~ 0.3-0.4. For comparison to r, we 
list the ratio o^jo g \J\ — r 2 , which is ~ 0.8-0.9 at all scales, indicating that stochasticity, 
rather than nonlinearity, dominates the scatter around the linear regression slopes br and 
b/r. 

The parameters b and r will also clearly depend on what age galaxies one considers, 
as Figures |3a| and |3l] indicate. In Table [5] we list b and r for each of the quartiles and 
smoothing scales used in Figures |3a] and |3b]. Again, the dependence on scale is evident. 
However, more prominent are the differences between galaxies of different ages. Older 
galaxies are much more highly biased than young galaxies at all scales. In addition, older 
galaxies are more correlated with the mass distribution; r ~ 0.9-1.0 for the oldest galaxies, 
while r ~ 0.5 for the youngest galaxies. These results are agree with observations that 
early-type galaxies are biased relative to late-type galaxies. The low correlation coefficient 
for young galaxy particles means that fluctuations in their density field are poorly correlated 
with fluctuations in the mass density field, as one can see from Figures [la] and especially 
|TE| ; that b is near unity on large scales merely indicates that the fluctuations in the two 
fields are of similar amplitudes. 



4.2. Scale Dependence 

To study more carefully the scale dependence of the bias, we show b(R) in Figure f| and 
r(R) in Figure [5[ The solid black lines refer to all the galaxies; the dashed lines refer to each 
age quartile, as labeled. These curves show the same behavior found in Tables [2] and |3[ The 
older galaxies are more biased than the younger galaxies. Furthermore, the density field of 
the oldest quartile is extremely well-correlated with the mass density field, while that of 
the youngest quartile is extremely poorly correlated. Meanwhile, bias declines with scale 
for all the galaxies and for each quartile. Other investigators have published comparable 
results, and it is appropriate here to address the similarities and differences between the 
current results and those of others. We first note that these results are qualitatively similar 
to the previous results of |Cen fc Ostriker (1992b)| using the same method but with different 



cosmologies. We also, for the purposes of this discussion, define b^(R) = £ 9 (i?)/£(.R), the 
ratio of the galaxy and mass correlation functions on scale R. It is roughly, though not 
exactly, comparable to our definition b = o g jo. 
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The most recent iV-body results, using the Adaptive Refinement Tree (ART) method 
( [Kravtsov, Klypin, fc Khokhlov 1997] ) , indicate, in fact, that there is antibias at small 
scales, and that the bias increases with scale ( [Kravtsov fc Klypin 1998] ). This effect seems 
to be due to the merging and destruction of halos in dense regions. [Klypin et al. 1991? 



claim that ART has sufficient dynamic range to avoid the "overmerging" problem that 
has plagued pure iV-body simulations ( [White 1976| ; [Frenk et al. 1988| ; [van Kampen 1995 



|Summers, Davis, fc Evrard 1995| ; [Moore, Katz, fc Lake 19981) . Since we track stellar mass 
density, rather than galaxy number density, we cannot address the questions of mergers and 
destruction, and thus these effects are not apparent in our results. 



On the other hand, the results of the SPH simulations of |Uarlberg fc Uouchman (1989) 



Katz, Hernquist, fc Weinberg (1992)| , and |Evrard, Summers, fc Davis (199^)] all show a 
slight dependence of b^(R) on R between 1 and 10 h~ x Mpc, and a substantial increase 
of the bias on smaller scales (~ 0.3 h~ l Mpc). These results are in qualitative agreement 
with ours, and it is likely that the physical effects (described in Section ||) which cause the 
scale-dependence in our simulations are also important in the SPH simulations, since the 
criteria for producing galaxies in those simulations are similar to those in ours. 

Kauffmann, INusser, & Steinmetz (1997)' used semi-analytic modelling techniques 
combined with A-body simulations to explore the relationship between the clustering 
of galaxies and mass. Their method has the advantage that it can efficiently explore 
parameter space; however, it cannot model the gas dynamics or the effects of environment. 
For galaxies with Mb < —20 in an open = 0.2 CDM model, they find a similar scale 
dependence of the bias to ours (their Figure 6; note that their definition of "bias" is close to 
br, the regression of galaxy density on mass density, in our notation). For the Qq = 1 CDM 
model, they find that the scale-dependence is present but less acute. On the other hand, 
Kauffmann et al. (1998)| shows = £ s (r)/£(r) < 1 on small scales and increasing to larger 
scales for their ACDM cosmology, and b slightly greater than one for the rCDM cosmology. 
Kauffmann et al. (1998)| normalize their galaxy formation parameters somewhat differently 
than do Kauffmann, Nusser, fc Steinmetz (1997)| , such that galaxies of the same luminosity 
are in lower mass halos in the later work; this accounts for part of the difference between 
their results. In sum, these semi-analytic models, which have a vastly different model of 
galaxy formation from ours, can in some cases produce scale-dependence similar to that 
found here. 

We are interested in discovering exactly why bias decreases with scale in our results. 
Scale dependence can be due to two things only: the nonlinearity of the locally defined 
(S g \5), or the properties of the field of residuals e about that mean. Several theoretical forays 
suggest that the first possibility cannot be the case QColes 1993| ; (gcherrer fc Weinberg 1998] ; 



13 



Dckcl fc Lahav 1998 ). We can test these results by applying the (8 g \S) defined on small 
scales to the mass density field. Specifically, we calculate (S g \S) for the galaxy and mass 
fields smoothed with a 1 h^ 1 Mpc radius top hat; then, at each grid cell, we check the value 
of 5 and set the galaxy density in that cell to the appropriate value of (5 g \S). We refer to 
the resulting field as the "fake galaxy" density field. 

We can now consider the statistic b(R) for this fake galaxy field, which we determined 
using the mass density field alone, shown as the dotted lines in each panel of Figure |6|. 
Here, R takes into account the 1 h" 1 Mpc smoothing already present in S. The solid lines 
are the actual b(R) for the galaxies found in the simulations, from Figure The top panel 
corresponds to all the galaxies, the middle panel to the oldest galaxies, and the bottom 
panel to the youngest galaxies. This procedure does not reproduce the behavior of bias as a 
function of scale, at least when we use this single- variable model (5 g \S). There is some scale 
dependence, which is allowed by the results of Scherrer fc Weinberg (1998] in the nonlinear 



regime. At small scales, the fake galaxy distribution has less power than the actual galaxy 
distribution, while at large scales, the fake galaxy distribution remains much more biased 
than the actual galaxy distribution. Thus, the mean relation (S g \S) between galaxies and 
mass at small scales does not contain all the information about S g . 

The remaining possibility is that the properties of the residual field e about (S g \8) 
cause the scale dependence. These residuals must correlate over large scales in such a way 
that e < in large scale overdensities and e > in large scale voids. In that case, (Se) R 
will be negative on large scales and, as Equation (|7|) reveals, b(R) will decrease with scale. 
To test this possibility, we calculate e at 1 h' 1 Mpc top hat smoothing, and then smooth 
e and 5 on scale R in order to calculate (5e) R /a 2 (R). Figure [7| shows the behavior of this 
quantity. As one would predict, it is near zero when R ~ 1 h^ 1 Mpc, and becomes strongly 
negative on larger scales. The same holds for the youngest and oldest quartiles, also shown 
in Figure 0. This result indicates that, indeed, the residual field tends to be negative in 
large scale overdensities, and positive in large scale voids. 

In other words, the scatter about (5 g \S) has interesting statistical correlations. It 
depends on other variables which are important to galaxy formation. The dependence on 
and the nature of these variables must be such as to reproduce the scale dependence found 
in this section. In the next two sections, we investigate the dependence of galaxy density on 
a number of other variables, and find that accounting for the dependence of galaxy density 
on local temperature can both reduce the scatter in the residual field and explain the scale 
dependence of b. 
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5. Two- Variable Bias 

From the last section we learned that modeling the galaxy density field as a function 
of mass density alone was unsatisfactory. That is, spatial correlations in the scatter e are 
important. We would like to both reduce the scatter in our estimate of the galaxy density 
and to explain the scale dependence of b(R). To do so, we can add an independent variable 
to our conditional probability and consider the function P(5 g \8, JQ). Note that to account 
for the scale dependence, Xj must have spatial correlations similar to those of e; we discuss 
this point further in the next section. We try several Xi and find that the most successful is 
the local gas temperature T, or, equivalently, the local dark matter velocity dispersion. This 
result is perhaps not surprising, since gas temperature is surely an important parameter 
in galaxy formation. That is, in order to form stars, gas must be able to cool efficiently 
and collapse, which cannot happen at very high temperatures. In addition, we will find in 
Section [] that accounting for this dependence on T also accounts for the dependence of 
b(R) on scale. 

For each Xi we choose as a second independent variable, we want to know how much it 
reduces the stochasticity in the relation P(8 g \S, Xi). To quantify this, we calculate the ratio: 

(to, - («) 2 > l/2 ' 

which expresses the scatter of 5 g around (5 g \5,Xi), compared with the scatter around (5 g \5). 
Thus, if Xi is perfectly correlated with 5, or if it is perfectly uncorrelated with 5 g , then 
Cb^/vb ~ 1. In the next few paragraphs, we will substitute for the variable Xi the following: 
T, the local temperature; (t> 2 ), the local dark matter velocity dispersion; the prolateness 
and oblateness of the local dark matter density field; and the shear in the dark matter 
velocity field. Our results for cr^/crb f° r these variables are given in Table f|. 

The local temperature T is relevant to galaxy formation because gas which is too hot 
does not satisfy the Jeans criterion and cannot cool efficiently, and thus cannot collapse and 
form stars. Since this criterion is explicitly included in our conditions for the condensation 
of galaxy particles out of the gas, it is a reasonable second variable to investigate. Figure |] 
shows a contour plot of galaxy density as a function of mass density and gas temperature; 
all fields in this plot have been smoothed over 1 h^ 1 Mpc radius top hat spheres. First, 
note that local gas temperature and local dark matter density are clearly not independent 
variables; the upper left triangle in the figure is blank because there are no volume elements 
with high local density and low temperature on these scales. Nevertheless, one can see that 
galaxy density declines as the temperature increases at constant 5. Figure |9] shows 8 g as a 
function of T for 5 = 20, corresponding to a horizontal slice of Figure [8|. The strength of the 
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temperature dependence is evident. From Table |], we note that the inclusion of this extra 
variable reduces the variance by over a factor of two, indicating that variations in local 
temperature can account for a large portion of the stochasticity in the biasing relations of 
Figure |2|. In Figure |^ we also plot the dependence on temperature of the galaxy density in 
each age quartile, as the dashed curves. As one would expect, the density of the youngest 
galaxies is the most temperature-dependent; that is, in the hottest regions, there are no 
new galaxies forming, although plenty of old galaxies exist there, which formed there in the 
past or have since fallen in. We have performed the same analysis using a simulation 50 h^ 1 
Mpc on a side, with twice the resolution of our 100 h -1 Mpc simulation. The dependence 
of galaxy density on mass density and temperature is identical, except for S < 5, where 
galaxies are produced more efficiently in the higher resolution simulation. This difference 
has little effect on the quantities b and r, although the void probability function is affected, 
in the sense that voids are less likely in the higher-resolution simulation. 

A fairly good fit to the dependence of galaxy density on mass density and temperature 
on small scales is: 

We show such a fit in Figure [10]. As labeled, each pair of solid and dotted lines corresponds 
to a different value of 5. The solid lines are the results from Figure |8|; the dotted lines are 
the fit of Equation (|), using L = 1.23, M = 1.9, and N = -0.66. For N < 0, the factor 
involving T takes into account the fact that relatively fewer galaxies have formed in hotter 
regions; the form assumed here reflects the approximate power law dependence in Figures 
|9| and [It]. This effect is not important once the gas is as cold as 40, 000 K, and thus we 
construct the temperature factor to have little effect in that regime. 

We would like to be able to apply such a fit to iV-body simulations. Doing so would 
allow us to explore changes of cosmological parameters more easily than the expensive 
hydrodynamical simulations allow. Although we cannot follow gas temperature in purely 
collisionless simulations, we can calculate the related quantity (v 2 ), the local dark matter 
velocity dispersion. The dependence on (v 2 ) should be similar to that on local gas 
temperature. After all, in virialized regions the velocity dispersion of dark matter particles 
is close to that of individual atoms. Indeed, from Table f|it is clear that o^^job is nearly the 
same for the velocity dispersion as it is for the temperature. Note, of course, that taking 
into account the dependence of galaxy density on both (v 2 ) and T would not improve on 
using either one separately, as they are essentially equivalent variables. 

Next, we consider the oblateness and pro lateness of the density field in spheres of 1 hT 1 
Mpc, calculated from the eigenvalues of the local inertia tensor Ai, A 2 , A 3 ; the prolateness 
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is 2Ai/(A 2 + A 3 ) — 1 and the oblateness is (Ai + A 2 )/2A 3 — 1. As one can see from Table |, 
these quantities are not important to the galaxy density. Finally, we consider the shear in 
the velocity field, defined by: 

£ 2 = ^(S^) 2 , where 

Sjj = -(v i: j + v j:i ) - -v k ,kdij. (10) 

Clearly the shear will correlate well with the dark matter density, as the dark matter in 
the higher density regions will have higher velocities. To account for this, we consider the 
dimensionless quantity / H^il + 5). Another way to justify this scaling is to think of the 
S as a characteristic shearing rate, to be multiplied by the dynamical time, which scales as 
(1 + 5)" 1 / 2 . From Table [|, the galaxy density seems largely independent of the shear in 
the velocity field. Thus temperature, or equivalently velocity dispersion, seems to be the 
second parameter which best minimizes the stochasticity of the galaxy-mass relationship. 
There may be additional variables important to the process of galaxy formation, but this 
set of variables is fairly broad and includes the most likely candidates. 



6. Explaining Scale Dependence 

The last section revealed that local temperature is an important parameter for 
determining the local galaxy density. Is it possible that the temperature dependence can 
affect the scale dependence of b(R)l That is, as we pointed out in the last section, does the 
temperature have statistical properties similar to those of the scatter around the conditional 
mean (S g \5)l We can repeat the exercise of Section £| and calculate a fake galaxy density 
field, this time using the two-variable mean (S g \5, T) of Figure ||. We then calculate b(R) 
as before and plot it as the short dashed line in each panel of Figure |6|. The temperature 
dependence indeed beautifully accounts for the variation of b(R) with scale, for all the 
galaxies as well as for each quartile. The fit to the density distribution given by Equation 
(|) produces nearly identical results; it is shown as the long dashed line in the upper panel. 

What are the physical properties of the temperature field that cause this to happen? 
Essentially, it is that local temperature reflects the gravitational potential and thus contains 
information about the large scale density field. The temperature fluctuations 5t = Tj (T) — 1 
can be expressed in a simple way by considering some limits ( Spergel 1998| ). First, in the 



nonlinear regions, gas is virialized and its temperature must scale as the local potential. 
From Poisson's equation, one knows that 0(k) oc k~ 2 5(h). Thus, on small scales, it must be 
that &r(k) oc k~ 2 5(k) as well. Second, on linear scales one can assume that the temperature 
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fluctuations are dominated by the number density fluctuations of virialized haloes, since 
gas in those areas is much hotter than that in the empty regions between halos. Thus, on 
large scales &r(k) oc S(k). These two limits may be combined: 

where r n i is the transition scale between the linear and nonlinear regimes. Consequently, we 
expect the cross spectrum of temperature and mass to be: 

PTm{k) K l + k*r*JW (12) 

In Figure [TT], we compare this simple model with the simulations, finding that it is a good 
fit for the choice r n \ = 16 h^ 1 Mpc. This value for r n \ agrees approximately with the 
scale on which nonlinear effects should become important. Thus, the temperature power 
spectrum peaks at large scales; furthermore, at those scales the temperature fluctuations 
are directly related to the mass density. That means that the largest contribution to the 
local temperature actually comes from large wavelength fluctuations, which follow the large 
wavelength fluctuations in mass density. The local gas temperature is therefore a direct 
indicator of the large scale density field. Thus, accounting for the temperature dependence 
automatically accounts for the dependence on large scale density, and consequently the 
dependence of bias on scale. 

A simple model for the relation between galaxies, mass, and temperature reveals more 
explicitly how scale dependence and stochasticity enter the relation between galaxies and 
mass. Consider the fit given in Equation (|9|). If one assumes that 5 9 < 1, 5 < 1, and 
8t "C 1, this relation becomes: 

5 g = M'5 + N'8 T , (13) 

and one recovers the deterministic linear bias model if N' = and M' = b. This model is 
obviously highly unrealistic, especially at small scales. However, its simplicity will allow us 
to understand better how b(R) becomes scale-dependent. 

We can perform a linear regression on 5 and 5t to determine M' and N' . The results 



as a function of top hat smoothing scale are shown in Figure Notice that M' and N' 
are approximately constant with respect to scale. This invariance indicates that the scale 
dependence is well-accounted for by the temperature dependence. To examine this claim, 
let us assume that Equation ( |I~3"D holds at some small scale Rq. Now, if we smooth over a 
larger scale R, we find that: 

b y R) = £W = M >2 + N n^L + 2M'N' {66t)r (14) 
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Note that (55t) R /c 2 (R) and a^(R)/a 2 (R) will depend on R, because Equation (|TT|) shows 
that PTm(k) / P mm (k) and Prrik) / P mm {k) depend on k. Thus, b(R) = a g (R)/a(R) will 
also depend on R. We can test this possibility directly by applying the deterministic 
two- variable linear model to this simulation and calculating b(R). For this test we use the 
values M' = 2.4 and N' = —0.4, which are appropriate at 1 h^ 1 Mpc; the resulting curve is 
the dot-dashed line in the upper panel of Figure |6|. At small scales, the linear approximation 
is (as expected) poor but of the right order; on large scales, it reproduces the value of b(R) 
fairly well. 

Thus, even this simple linear model reproduces the scale dependence. We do not 
claim that this model is a particularly good one for describing the galaxy density; the fit of 
Equation (|9]) is much better. Instead, the linear model is merely a toy which illustrates the 
following point: given a dependence of galaxy density on local temperature, it is inevitable 
that b(R) is a function of scale on scales smaller than about r n i, simply because of the 
relationship between gas temperature and mass density. 

7. Discussion 

Consider two regions of the universe, both of which have the same local mass 
overdensity; however, with respect to the large scale density field, one is in an overdensity, 
the other in a void. The gas in these two regions will evolve similarly, forming galaxies with 
about equal efficiency, until the large scale density field becomes nonlinear. At this point, 
the ambient gas around the first region, in the large scale overdensity, will become too hot 
to accrete any longer, and galaxy formation will cease. Meanwhile, the gas around the 
second region, in the void, will remain cool enough to accrete onto old galaxies and continue 
to form new ones. This picture explains part of the scatter in the local relation between 
5 and 5 g ; it also explains why this scatter is such that in hot regions e = 5 g — (S g \S) < 0, 
and in cold regions e > 0. In turn, the correlation of temperature with large scale density 
explains why e also correlates with large scale density, causing scale dependent bias. In 
addition, it indicates qualitatively why spirals, which are caused by late-time accretion of 
gas, are relatively more abundant in the "field" and relatively underabundant in the rich 
clusters. 

In this scenario, scale dependent bias follows from only one rather robust assumption: 
that the ambient gas temperature affects the efficiency of galaxy formation. Since gas 
must be able to cool to form galaxies, this assumption is well-motivated theoretically. 
Furthermore, observations of the star-formation rate as a function of local density indicate 
that, indeed, star formation is reduced in the hot cluster environments, even at a fixed 
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morphological type (|Young et al. 1996 ; Hashimoto et al. 1997| ; Balogh et al. 1998 ). Once 



one concedes that local temperature is an indicator of the efficiency of galaxy formation, 
the arguments in Section ^| lead one to directly conclude that b(R) should depend on scale, 
at least in the quasilinear and nonlinear regimes, where PTm{k) and Ptt(&) have different 
shapes than P mm (k). In the simulations studied here, the effects are particularly strong. 

Scale dependence in the relation between galaxies and mass can affect the interpretation 
of future redshift and peculiar velocity surveys. The most obvious example is that on 
small scales, the shape of the galaxy power spectrum will differ from that of the mass. 
Furthermore, as pointed out by pekel fc Lahav ( 1998 )| , comparison of the observed galaxy 



density field to that inferred from observed peculiar velocities ( |Dekel 1994 ; |Sigad et al. 1998| ) 



effectively perform a regression of S on 5 g and thus measure the quantity (3 = (r/6)/(0), 
which we show here depends both on scale and on the chosen galaxy sample. Thus, the 
current analyses, usually performed in the quasilinear regime and using IRAS galaxies^ 
to define the density field, may be sensitive to these effects. For instance, one's estimate 
of r/b will generally increase with scale, by about 20% between 5 h~ x Mpc and 30 h^ 1 
Mpc. Amusingly, although r and b vary by a factor of two between the youngest and 
oldest galaxies on large scales, the dependence of r/b on galaxy age remains quite small; 
r/b ~ 0.6-0.7 for all four quartiles. In the real world, the regression of 5 on 5 g will most 
likely not be this constant among the morphological types. The decrease of (3 with scale 
could contribute to the differences between the results of [Sigad et al. (199871 an d of analyses 
carried out with smaller smoothing scale, such as VELMOD ( [Willick fc Strauss 1998j ); 
however, there are stark differences between these methods, and a more careful analysis is 
thus necessary. Indeed, we are interested in exploring the effects of nonlinear stochastic 
bias on a variety of large-scale structure statistics inferred from redshift surveys, including 
redshift-space distortions ( Pekel fc Lahav 1998| ) and the pairwise velocity dispersion as 



a function of local density ( [Strauss, Ostriker fc Cen 1998] ). A full treatment will require 
identifying individual galaxies from the galaxy particles in the simulations, which will 
require tackling the overmerging problem in the densest regions of the simulations. 

We can address some of these issues without having to run expensive hydrodynamic 
simulations for a range of cosmo logical models. First, in order to examine in detail the 
effect of the relation between galaxy density, mass density, and temperature on all of these 
statistics, we plan to carry out iV-body simulations of larger dynamic range than is possible 



3 IRAS galaxies are typically young, although they are probably not as young as our youngest quartile. 
In particular, IRAS galaxies do not show an underdensity in rich clusters that the youngest galaxies in the 
simulations do (Fig. ^), although they are less overdense than are optically selected galaxies (Strauss et 
al. 1992a). 
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with the current hydrodynamical simulations. To characterize the galaxy distribution 
in these simulations, we will apply the model of Equation (|9|), using (v 2 ) as a proxy for 
temperature. Then one can explore the effect this type of bias can have on the various 
statistics discussed in the last paragraph. A second approach is to analyze observations, 
allowing for scale- and temperature-dependent bias of the character described here. In 
this vein, one could investigate the differences between galaxy types and see how they 
compare in detail with the differences we find between our age quartiles. Using the local 
galaxy velocity dispersion as a proxy for temperature, we could even directly investigate 
the dependence of the density of difference morphological types on temperature. 

Of course, that temperature is the important causal variable in these simulations 
is based only on a post hoc (though physically plausible) argument in this paper. Any 
variable that probes the large-scale density field would serve just as well to reduce a^/cft 
and explain the scale dependence of b(R). However, as discussed above, temperature is 
a well-motivated quantity. Controlled tests of the effect of ambient temperature on star 
formation in these simulations might help clarify the matter. Another approach is to look 
at various output times and examine under what conditions in the simulations the galaxies 
actually form. We do so in a separate paper, which examines the time dependence of galaxy 
formation and bias in these simulations. That work makes it clear that the temperature 
dependence is a result of the Jeans mass and cooling criteria that the code uses to decide 
where galaxy particles condense out of the gas. 

The alert reader will notice that some of our results contradict our opening statements 
in Section [I]. In particular, if one looks at our results on large scales, it does happen to be 
true that the distributions of old galaxies and of young galaxies both differ considerably 
from the mass distribution, but in combination trace the mass quite well and are almost 
unbiased. We simply note here that the same would not be true were we to look at the 
results of these simulations at z = 0.5, for instance. In addition, pen fc Ostriker (19921)) 
found that for a hot dark matter universe (HDM), b was significantly greater than unity 
on large scales, indicating that the level of bias in these simulations depends somewhat on 
the chosen cosmology. Thus we ascribe little importance to this coincidence in the current 
simulations. 

Note also that these results concern stellar mass density, not galaxy number density. 
Since the galaxies in the densest regions of the simulations overmerge, to what degree the 
bias described here affects the brightness, rather than the number, of galaxies is unknown. 
Only if the stellar mass function of galaxies is universal will the bias found here translate 
directly into galaxy number density bias. On the other hand, previous theoretical results 
hint that the most massive halos form preferentially in large-scale overdensities ( [Mo fc 
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White 1996). Meanwhile, these massive halos may form fewer stars per unit mass than 
less massive halos flWhite fc Frenk 1991| ; [Katz, Hernquist, fc Weinberg 199^ ; [Evrard, 



Summers, fc Davis 1994| ; [Kautlmann et at. 1998] ). Observationally, [Bromley et at. (1998)| 



find that early-type galaxies are fainter in dense regions than elsewhere. The temperature 
dependence found in our simulations would certainly explain the last two effects, which do 
affect the brightnesses, but not the numbers, of galaxies. 

Several further notes of caution are in order concerning applying these results in 
detail. First, the simulations have limited resolution and do not probe physics on scales 
less than 200 h^ 1 kpc. Thus, we cannot follow individual halos in dense regions. As a 
check, we performed the same analysis on a 50 h~ l Mpc simulation of twice the resolution, 
finding consistent results in the regime 5 > 5 for the dependence of galaxy density on mass 
density and temperature, smoothed at 1 h~ l Mpc. On the other hand, the 50 h~ x Mpc 
simulations produced more galaxies in the regions with 5 < 5. Second, by necessity all 
the complications of the interstellar medium (ISM) and small scale dynamics are ignored. 
Surely the morphology of the ISM, its interaction with infalling gas, and its reaction to high 
star-formation rates is important to the formation and evolution of galaxies. Third, even 
to the extent that the simulation is physically accurate, the dependence of the results on 
cosmological parameters is unknown. Finally, we have not experimented here with varying 
the galaxy formation and feedback parameters, although this has been done in previous 
work ( |Cen fc Ustriker 1992b| ; Gnedin 1996a,b). We plan future simulations of higher 



resolution both to probe variations in cosmology, and to attempt to identify individual 
galaxies even in the denser regions of the simulation. In addition, we are working on 
methods of efficiently exploring the effects of varying the galaxy formation parameters. 

The general result of the simulations is that the relationship between mass and galaxies 
is interestingly complicated. Given the precision and volume of upcoming redshift surveys, 
it is possible that our understanding of the mass distribution on small scales will be 
limited by our ignorance of the properties of galaxy formation and of the origin of different 
morphological types. Hopefully, cosmological simulations such as this one can help us 
understand in what ways, and possibly to what degree, the galaxy distribution can differ 
from that of the mass. 
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Table 1. Age and redshift ranges of galaxy particle quartiles of equal total mass. 



Quartile 


Age Ran£ 


?e (Gyrs) 


Redshift Range 


Oldest 


9.6 - 


12.7 


1.9 — oo 


Second Oldest 


7.8 - 


- 9.6 


1.1 — 1.9 


Second Youngest 


5.7- 


- 7.8 


0.6 — 1.1 


Youngest 


- 


5.7 


— 0.6 



Table 2. Moments of total mass and galaxy mass density distributions. 



Top hat radius (h 1 Mpc) 


a 


b = a g /a 


r = (65 g )/aa g 




a b /a g y/l - r 2 


1.0 


4.77 


2.61 


0.886 


0.420 


0.905 


2.0 


2.67 


1.94 


0.902 


0.367 


0.850 


5.0 


1.14 


1.52 


0.923 


0.314 


0.817 


8.0 


0.754 


1.41 


0.936 


0.295 


0.836 


16.0 


0.401 


1.27 


0.941 


0.312 


0.920 


30.0 


0.184 


1.24 


0.945 


0.303 


0.924 



Table 3. Moments of total mass and galaxy mass density distributions for each quartile. 



Top hat radius (h 1 Mpc) Redshift Range b = a g /a r = (55 g )/aa g crb/a y o^/o^vT — 



1.0 


1.9 — oo 


3.58 


0.897 


0.372 


0.844 




1.1 — 1.9 


3.29 


0.896 


0.394 


0.886 




0.6 — 1.1 


2.67 


0.745 


0.558 


0.836 




— 0.6 


2.13 


0.524 


0.687 


0.807 


30.0 


1.9 — oo 


1.65 


0.990 


0.139 


0.981 




1.1 — 1.9 


1.56 


0.979 


0.188 


0.928 




0.6 — 1.1 


1.25 


0.882 


0.437 


0.929 




— 0.6 


0.834 


0.502 


0.817 


0.944 



Table 4. Reduction in the standard deviation o^/o" for various choices of Xj. 



Local Temperature: T 0.70 

Dark Matter Velocity Dispersion: (v 2 ) 0.68 

Velocity Shear: £/# 2 (l + 5) 0.96 

Oblateness (1 hr l Mpc spheres): (Ai + A 2 )/2A 3 - 1 0.97 

Prolateness (1 hr l Mpc spheres): 2Ai/(A 2 + A 3 ) - 1 0.97 
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Fig. la. — A slice through our simulation 50 h^ 1 Mpc on a side (one half the total box 
length). As labeled, the quadrants show the fractional overdensity 5 in dark matter, in the 
galaxies, in the young galaxies, and in the old galaxies, in clockwise order. The stretch is 
logarithmic and is set the same in all quadrants. The fields are smoothed with a 1 hr l Mpc 
Gaussian filter. The scale bar indicates 10 hr l Mpc. Note the large voids in the galaxy 
distribution and the reduction in the fraction of young galaxies in the large overdensity near 
the center. 
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Fig. lb. — Same as Figure [TaL now using 10 h Mpc Gaussian smoothing. It is apparent 
now that on these scales the oldest galaxies are highly biased and that the young galaxies 
are poorly correlated with the mass. They appear to be overdense along the filaments but 
underdense in the clusters. The young galaxies appear quite different than one would guess 
based on Figure [la], but note that that figure has a logarithmic stretch and that structure 
outside the two-dimensional slice contributes significantly to the smoothed field in this figure. 
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10° 10 1 10 2 1 3 1 1 10° 10 1 10 2 




10° 10° 

Mass Density =1+8 

Fig. 2. — Galaxy mass density as a function of mass density for a variety of top hat smoothing 
radii. The shading is a logarithmic stretch of the conditional probability P(l + S g \l + 5); 
thus, each column is normalized separately. The solid black lines indicate (1 + 5 g \l + 5); the 
dotted black lines indicate the la deviation from the mean. 
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Fig. 3a. — Galaxy mass density as a function of dark matter density for each age quartile. 
Smoothing filter is a 1 h~ x Mpc radius top hat sphere. In each panel we list the range of 
formation redshifts included. The plot is of the same form as Figure EL 
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Fig. 3b. — Same as Figure BaL with a 30 h 1 Mpc radius top hat smoothing filter. 
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Fig. 4. — The bias b(R) = a g (R)/a(R). The solid line indicates all the galaxies. The 
dashed lines indicate the age quartiles, with range of formation redshifts listed. Note the 
strong scale-dependence, and the fact that old galaxies are more biased than young galaxies. 



-34- 




Fig. 5. — The correlation coefficient r(R) = (S g S) / 'a g (R)a(R) as defined in the text. The 
lines have the same meanings as in Figure [|. The youngest galaxies are poorly correlated 
with the underlying mass distribution at all scales. For the oldest galaxies, the decline at 
small scales of the correlation coefficient is probably mainly due to the nonlinearity of (S g \S). 
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Fig. 6. — The bias b(R) for various bias models compared with b(R) for the galaxy particles 
in the simulations. As labeled, the top panel refers to all the galaxies, the middle panel to 
the oldest quartile of galaxies, and the bottom panel to the youngest quartile of galaxies. 
The solid lines are for the actual galaxy particles. The dotted lines are for a galaxy field 
defined by (5 g \S), with a 1 h' 1 Mpc tophat smoothing. The short dashed lines are for a 
galaxy field defined by (6 g \6,T). The long dashed lines are for a galaxy field defined by the 
fit to (S g \S,T) given in Equation || The dot-dashed line is for a galaxy field defined by the 
toy model in Equation fll3|) , with M' = 2.4 and N' = —0.4. All the two- variable models 
reproduce the scale-dependence much better than does the single-variable model. 
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Fig. 7. — The correlations between the density field 5 and the residuals e = 5 g — (5 g \5) 
defined on 1 hr l Mpc scales, as a function of scale. The solid curve is for all galaxies, and 
the dashed are for the oldest and youngest quartiles, as labeled. As explained in the text, the 
correlation of residuals with the large scale field is an indication that b(R) will be a function 
of scale. Also note that the dependence is stronger for the young galaxies than for the old. 
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10 4 10 5 10 6 10 7 
Temperature (°K) 

Fig. 8. — Dependence of galaxy density 1 + S g on dark matter density 1 + 5 and temperature 
T, evaluated at 1 h' 1 Mpc smoothing. The greyscale is a logarithmic stretch of 1 + 5 g ; the 
contours are in even logarithmic intervals. 
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Fig. 9. — The dependence of p g j (p g ) = 1 + 5 g on T at fixed S for all the galaxies (solid 
line) and each quartile (dashed lines, as labeled). Note that (1 + 5 g ) varies over an order of 
magnitude in this temperature range. Also, note that the young galaxies have the strongest 
dependence on temperature and the oldest galaxies have the weakest. 
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Fig. 10. — The dependence of p g j (p g ) = 1 + 5 g on T at a series of fixed 6 for all the galaxies. 
The solid lines are the actual dependence of the galaxies in the simulations found in Figure 
|[ The dotted lines are from the fit in Equation @, with the parameters L = 1.23, M = 1.9, 
and N = -0.66. 



-40- 




Fig. 11. — Comparison of the simple model for the temperature-density cross spectrum 
given in Equation [12] with the cross spectrum measured in the simulations. The solid line is 
from the simulations and the dashed line is the model, for r n \ = 16 h" 1 Mpc. 
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Fig. 12. — The linear parameters M' and AT' of Equation flI3|), as a function of scale. We 
calculated these using a linear regression on 5 and <5t- Note the scale independence of this 
model, as compared to the scale dependence in the single- variable model parameter b(R), 
shown in Figure |j. 



